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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: White, Theresa C 

Hindle, Christopher D 

TITLE OF INVENTION: GENETIC CONSTRUCTS AND GENETICALLY 
MODIFIED MICROBES FOR ENHANCED PRODUCTION OF 
BETA-GLUCOS I DAS E . 

NUMBER OF SEQUENCES: 4 

CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fitzpatrick, Cella, Harper & Scinto 

(B) STREET: 1900 K Street, N.W. 

(C) CITY: Washington 

(D) STATE: D.C. 

(E) COUNTRY: USA 

(F) ZIP: 20006-1110 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: CA 09/037,524 

(B) FILING DATE: 10-MAR-1998 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 
(A) NAME: Olsen, Warren E 
(C) REFERENCE/ DOCKET NUMBER: 1039.2500 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-530-1010 

(B) TELEFAX: 202-530-1055 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichoderma reesei 

(ix) FEATURE: 



(ii) 





(A) NAME/ KEY: sig_peptide 

(B) LOCATION: 1..51 

(D) OTHER INFORMATION: /function= "cbhl secretion signal" 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 61.. 72 

(D) OTHER INFORMATION: /function= "mature 
beta-glucosidase" 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..72 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATG TAT CGG AAG TTG GCC GTC ATC TCG GCC TTC TTG GCC ACA GCT CGT 
48 

Met Tyr Arg Lys Leu Ala Val lie Ser Ala Phe Leu Ala Thr Ala Arg 
-20 -15 -10 -5 

GCT CAG TCG GCA GTT GTA CCT CCT 
72 

Ala Gin Ser Ala Val Val Pro Pro 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Tyr Arg Lys Leu Ala Val lie Ser Ala Phe Leu Ala Thr Ala Arg 
-20 -15 -10 -5 

Ala Gin Ser Ala Val Val Pro Pro 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(vi) ORIGINAL SOURCE: 



1 



1 



(A) ORGANISM: Trichoderma reesei 




(ix) FEATURE: 

(A) NAME/ KEY : sig_peptide 

(B) LOCATION: 1..99 

(D) OTHER INFORMATION: /function= "xln 2 signal peptide" 

(ix) FEATURE: 

(A) NAME/ KEY: mat_peptide 

(B) LOCATION: 109.. 120 

(D) OTHER INFORMATION: /function= "mature 
beta-glucosidase" 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..120 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
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GTC 


TCC 


TTC 


ACC 


TCC 


CTC 


CTC 


GCC 


GGC GTC 


GCC 


GCC 
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GGC 
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Met 


Val 


Ser 


Phe 


Thr 


Ser 


Leu 


Leu 


Ala 


Gly Val 


Ala 


Ala 


He 


Ser 


Gly 


-36 
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GCC 


GCT 
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GCC 


GCC 


GAG 


GTC 


GAA TCC 


GTG 


GCT 


GTG 


GAG 


AAG 


96 






























Val 


Leu 


Ala 


Ala 


Pro 


Ala 


Ala 


Glu 


Val 


Glu Ser 


Val 


Ala 


Val 


Glu 


Lys 
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CGC 


CAG 


GCT 


AGA 


GTT 


GTA 


CCT 


CCT 
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Arg 


Gin 


Ala 


Arg 


Val 
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Val 


Pro 


Pro 
















(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 4: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Val Ser Phe Thr Ser Leu Leu Ala Gly Val Ala Ala He Ser Gly 
-36 -35 -30 -25 

Val Leu Ala Ala Pro Ala Ala Glu Val Glu Ser Val Ala Val Glu Lys 

-20 -15 -10 -5 



Arg Gin Ala Arg Val Val Pro Pro 

1 



